DeepGauge: Comprehensive and Multi-Granularity Testing Criteria for Gauging the Robustness of Deep Learning Systems
نویسندگان
چکیده
Deep learning defines a new data-driven programming paradigm that constructs the internal system logic of a crafted neuron network through a set of training data. Deep learning (DL) has been widely adopted in many safety-critical scenarios. However, a plethora of studies have shown that the state-of-the-art DL systems suffer from various vulnerabilities which can lead to severe consequences when applied to real-world applications. Currently, the robustness of a DL system against adversarial attacks is usually measured by the accuracy of test data. Considering the limitation of accessible test data, good performance on test data can hardly guarantee the robustness and generality of DL systems. Different from traditional software systems which have clear and controllable logic and functionality, a DL system is trained with data and lacks thorough understanding. This makes it difficult for system analysis and defect detection, which could potentially hinder its real-world deployment without safety guarantees. In this paper, we propose DeepGauge, a comprehensive and multi-granularity testing criteria for DL systems, which renders a complete and multi-faceted portrayal of the testbed. The in-depth evaluation of our proposed testing criteria is demonstrated on two well-known datasets, five DL systems, with four state-of-the-art adversarial data generation techniques. The effectiveness of DeepGauge sheds light on the construction of robust DL systems.
منابع مشابه
Fuzzy analytical network process logic for performance measurement system of e-learning centers of universities
This paper proposes an efficient performance measurement system to evaluate the excellence of e-learning centers of universities. The proposed system uses the analytic network process (ANP) as an effective multi-criteria decision making (MCDM) method and its fuzzy mode to respond to uncertainties in judgements. This system also needs a targeted and systematic criteria set which is collected thr...
متن کاملComprehensive Multi-Criteria Comparison and Ranking of Natural Gas Liquefaction Process by Analytic Hierarchy Process (AHP)
Several processes have been proposed for natural gas liquefaction due to the vast utilization of LNG as a reliable and relatively easy to use fuel. Even though the merits and demerits of different process have been studied, a dearth of comprehensive technical and economical comparative investigation of these methods makes further broad examination a necessity. This article is presented to addre...
متن کاملComprehensive Decision Modeling of Reverse Logistics System: A Multi-criteria Decision Making Model by using Hybrid Evidential Reasoning Approach and TOPSIS (TECHNICAL NOTE)
In the last two decades, product recovery systems have received increasing attention due to several reasons such as new governmental regulations and economic advantages. One of the most important activities of these systems is to assign returned products to suitable reverse manufacturing alternatives. Uncertainty of returned products in terms of quantity, quality, and time complicates the decis...
متن کاملBuilding a maintenance policy through a multi-criterion decision-making model
A major competitive advantage of production and service systems is establishing a proper maintenance policy. Therefore, maintenance managers should make maintenance decisions that best fit their systems. Multi-criterion decision-making methods can take into account a number of aspects associated with the competitiveness factors of a system. This paper presents a multi-criterio...
متن کاملGovernance of HIV/AIDS: Implications for Health Sector Response
This paper reviews the essence of effective governance and importance of a multi-sectoral approach in generating health systems response to HIV/AIDS. This comprehensive approach highlights the importance of integrating reproductive sexual health programs and HIV prevention services, including peer education, life skills, and Voluntary Counseling and Testing (VCT), for Prevention of Mother–to-Ch...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2018